Using Universal Linguistic Knowledge to Guide Grammar Induction
نویسندگان
چکیده
We present an approach to grammar induction that utilizes syntactic universals to improve dependency parsing across a range of languages. Our method uses a single set of manually-specified language-independent rules that identify syntactic dependencies between pairs of syntactic categories that commonly occur across languages. During inference of the probabilistic model, we use posterior expectation constraints to require that a minimum proportion of the dependencies we infer be instances of these rules. We also automatically refine the syntactic categories given in our coarsely tagged input. Across six languages our approach outperforms state-of-theart unsupervised methods by a significant margin.1
منابع مشابه
Nature, Nurture and Universal Grammar
In just a few years, children achieve a stable state of linguistic competence, making them effectively adults with respect to: understanding novel sentences, discerning relations of paraphrase and entailment, acceptability judgments, etc. One familiar account of the language acquisition process treats it as an induction problem of the sort that arises in any domain where the knowledge achieved ...
متن کاملUsing Left-corner Parsing to Encode Universal Structural Constraints in Grammar Induction
Center-embedding is difficult to process and is known as a rare syntactic construction across languages. In this paper we describe a method to incorporate this assumption into the grammar induction tasks by restricting the search space of a model to trees with limited centerembedding. The key idea is the tabulation of left-corner parsing, which captures the degree of center-embedding of a parse...
متن کاملUnderstanding stimulus poverty arguments1 JANET DEAN FODOR AND CARRIE CROWTHER
The argument from the poverty of the stimulus as Pullum and Scholz deÞne it (their APS) is undeniably true, given that all language learners acquire the ability to generate more sentences of the target language than they have heard. Uniformity across learners with respect to the additional sentences they project suggests that grammar induction is guided by general principles, which must be inna...
متن کاملRelationship between Iranian EFL High School Students’ Knowledge of Universal Grammar and their Performance on Standardized General English Proficiency Tests
This study investigated the relationship between Iranian high school students’ Universal Grammar knowledge and their performance on such standardized general English proficiency tests as PET and FCE internationally administered by Cambridge University. To this end, 108 students were randomly chosen from some high schools located in Malayer from Hamedan. Since this study was correlational in nat...
متن کاملSimple Robust Grammar Induction with Combinatory Categorial Grammars
We present a simple EM-based grammar induction algorithm for Combinatory Categorial Grammar (CCG) that achieves state-of-the-art performance by relying on a minimal number of very general linguistic principles. Unlike previous work on unsupervised parsing with CCGs, our approach has no prior language-specific knowledge, and discovers all categories automatically. Additionally, unlike other appr...
متن کامل